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ABSTRACT 



Oh' We compute the angular power spectrum Ce from 1.5 million galaxies in early SDSS data on large 



angular scales, I < 600. The data set covers about 160 square degrees, with a characteristic depth of 
order XhT 1 Gpc in the faintest (21 < r* < 22) of our four magnitude bins. Cosmological interpretations 
of these results are presented in a companion paper by Dodelson et al. (2001). The data in all four 
magnitude bins are consistent with a simple flat "concordance" model with nonlinear evolution and 
linear bias factors of order unity. Nonlinear evolution is particularly evident for the brightest galaxies. 
A series of tests suggest that systematic errors related to seeing, reddening, etc., are negligible, which 
bodes well for the sixtyfold larger sample that the SDSS is currently collecting. Uncorrelated error bars 
and well-behaved window functions make our measurements a convenient starting point for cosmological 
model fitting. 

Subject headings: large-scale structure of universe — galaxies: statistics — methods: data analysis 

1. INTRODUCTION two-dimensional (2D) power spectrum Ci from early imag- 

^ , , , • , ,,n c i • i • hig data in the Sloan Digital Sky Survey (SDSS; York et 

Galaxy clustering encodes a wealth of cosmological m- , „„„„s ™ , b . ,. . ,. ,-\ . ,. . 

c ,. -d , , . , , , i al. 2000 . Ihe angular correlation function w(0) oi this 

formation. By breaking degeneracies between cosmologi- „_„„ /. . . , . • i ^ 

, , jt^% ri u i SDSS data is presented in a companion paper by Con- 
cal parameters and by permitting powerful cross checks, ,. , , ,„£*-, \ i i ,i <■ „ i i , • 
., -i ,i i • i u i .I. nolly et al. (2001), and both of these angular clustering 
it complements other cosmological probes such as the cos- J v . '' . , „„ , ° „,,, , „ & 
i i i f^ flD \ r ii, • ii, I measures are inverted to 3D power spectra r\k) by Do- 
mic microwave background (CMB) both m theory (e.g., . . , ,___., N „, , . , v / ,/ ,, 
t^. , . , , ~, nn n\ j • L- i at j-l c u j. delson et al. (2001 . Ihe galaxies are analyzed directly 
Eisenstem et al. 1999) and m practice (e.g., Ncttcrhcld et . r , , n i , i 

al. 2001; Pryke et al. 2001; Stompor et al. 2001; Wang et TJnT^l P 7f S P ectrum Parameters b y gzalay et al. 

I 9finD (2001). Ihe data set upon which all these analyses are 

a AltriniiP-r, nnrplv anp-nlar ralaxv catalog lack the three baSed is P resented and extensively tested for systematic 

Although purely angular galaxy catalogs lack the three- ^^ Scranton rf d (20Q1 hereafter S200 l). 3D clus- 

dimensional (oD) information present in redslutt surveys, . J . , . . \ . . . .„ '. ... 

,, , j , \ 't. i-1- u t n • i termg using galaxies with measured redsmfts is studied 

they tend to be quite competitive because of their much . ° ■ , , /„^n, s A • i i tn\ ^ ■ ■ 

, i „ , . . . . . , , m , by Zehavi et al. (2001). An independent w(^)-analysis is 

greater numbers of galaxies. A case in point is the APM J , . „ \ „ ' ,„__. , ^ y ' J 

, . , ,.„ ., -,, , , ,v presented by Gaztanaga (2001). 

survey, which still provides one of the most accurate three- rm > i ?■ p ,■ ,„, , 

,. . i , , .. , i Ihe angular correlation function una) has many merits 

dimensional power spectrum measurements despite lack- ° . . v ' ■' 

ing redshift information (Efstathiou & Moody 2001). In as a mea f u f e of Mustering. It is fast to compute even for 

,,. . ., ,, , c ,i , ; ,i massive data sets, and its broad familiarity m the astro- 

this spirit, the goal of the present paper is to measure the . . .' . .... . .., ., , . , 

nomical community facilitates comparison with theoretical 



predictions as well as other observations. Notwithstand- 
ing, as detailed in Appendix A, the angular power spec- 
trum Ci has three virtues that makes it quite complemen- 
tary to w{9) and worth computing as wellrj: 

1. It is possible to produce measurements of Cg that 
have both uncorrelated errors and well-behaved win- 
dow functions. 

2. The Cg -estimators represent a lossless compression 
of the full data set in the sense that they retain all 
of its angular clustering information on large scales, 
where the Gaussian approximation applies. 

3. The Cg -coefficients are more closely related to the 3D 
power spectrum P(k) than w(0) is, in the sense of 
giving narrower window functions in fc-space (Baugh 
& Efstathiou 1994). This is an advantage for 2D i— ► 
3D inversions, since it reduces troublesome aliasing 
from small scales where nonlinear effects are difficult 
to model. 

These attractive properties have triggered a resurgence of 
interest in measuring Cg from galaxy surveys (Scharf & La- 
hav 1993; Baugh & Efstathiou 1994; Huterer et al. 2000), 
extending the pioneering work of Hauser & Peebles (1973). 

On small scales where nonlinear effects become impor- 
tant, the angular power spectrum loses much of its appeal. 
Non-Gaussian clustering introduces correlations between 
different £-bands, our method becomes computationally 
cumbersome, and much of the interesting physics takes 
place in real space rather than in Fourier space, with the 
observed clustering telling us more about halo properties 
than about the initial linear power spectrum. In summary, 
as described in Appendix A, the CV-analysis presented here 
and the w(8) analysis by Connolly et al. (2001) are highly 
complementary, with advantages on large and small scales, 
respectively. We therefore limit our analysis to large angu- 
lar scales £ < 600, corresponding to the linear and weakly 
nonlinear regime. A multipole £ corresponds roughly to an 
angular scale 9 ~ 180°/£, so our limit £ < 600 corresponds 
to a spatial scale of order 5/i _1 Mpc at the characteristic 
survey depth of l/i -1 Gpc. 

The rest of this paper is organized as follows. In Sec- 
tion S, we measure the angular power spectrum Cg and 
discuss how it is related to the underlying 3D power spec- 
trum P(k). In Section 0, we perform a range of tests and 
Monte-Carlo studies to assess the reliability of our results 
given potential problems with extinction, seeing, software 
and non-linear clustering, and summarize our conclusions. 
Two appendices discuss how our angular power spectrum 
measurements relate to the angular correlation function 
w(0) and the underlying 3D power spectrum P{k). 

21 It is worth emphasizing that although the theoretical Cg and 
w(8) are simply Fourier (more precisely Legendre) transforms of one 
another, there is no such equivalence between the measured Cg and 
w(8) because of incomplete sky coverage and other complications. 
Because different pair weightings are applied to the multitude of 
galaxies before they are compressed into the handful of Cg and w(d) 
numbers presented here and by Connolly et al. (2001), the infor- 
mation content in the two is different. Although it is possible to 
construct a lossless u>(#)-estimator that contains the same informa- 
tion as Cg , this is not desirable for the reasons described in Appendix 
A — it limits the dynamic range and it destroys a key property of 
conventional «j(0)-estimators: perfect window functions, i.e., the es- 
timated correlation at separation 6 probes only correlations on that 
scale. 



2. THE ANGULAR POWER SPECTRUM 
2.1. Data 

This paper builds on the foundation laid by S2001, 
which produces a galaxy sample demonstrated to be of 
sufficient quality to permit a large-scale angular cluster- 
ing analysis not dominated by systematic errors. We use 
the "EDR-P" sample described by S2001 for our analysis, 
which stands for early data release (Stoughton et al. 2001) 
with galaxy probabilities used in place of rigid counts^. 
It consists of galaxies in the 2.5° x 90° equatorial stripe 
145° < a 20 oo < 235°, -1.25° < <5 2 ooo < 1-25° with re- 
gions of high extinction and poor seeing discarded. We 
measured fluxes with the r filter. The r magnitude is de- 
fined by Fukugita et al. (1996), Stoughton et al. (2001). 
As in S2001, we analyze four subsamples of the galaxies 
separately, corresponding to ranges of model magnitude 
r* of 18-19, 19-20, 20-21 and 21-22, respectively. These 
four samples consist of effectively N =57,781, 158,636, 
428,920 and 886,936 galaxies, respectively, with assumed 
mean redshifts of 0.26, 0.36, 0.50 and 0.64, respectively. 
Assuming a flat 51a = 0.7 cosmology, this corresponds to 
mean comoving distances of 0.51, 0.71, 0. 95 and 1.19 
/i _1 Gpc, respectively. 

A set of powerful tools for angular power spectrum es- 
timation has been developed in the CMB community, and 
to take advantage of this, we begin by re-expressing our 
galaxy analysis problem in a form analogous to the CMB 
case. We do this by dividing our sky patch into N square 
"pixels" of side 12.5 arcminutes and computing the density 
fluctuation 

Xi = ^-1 (1) 

m 

in each one. Here rii is the observed number of galaxies 
in each pixel and hi is the expected number, taking into 
account the slight spatial variations in completeness as in 
S2001. The choice of 12.5' for the pixel height is convenient 
since it correspoonds the height of an SDSS camera column 
(Gunn et al. 1998), thereby maximizing the sensitivity of 
our tests for weather-related systematics (even and odd 
columns are observed on separate occasions). There are 
3695 pixels in each of the three brightest magnitude bins 
and 3274 in the 21 < r* < 22 bin where the seeing cuts 
were more stringent, corresponding to sky areas of 160 and 
142 square degrees, respectively. 

In the context of previous large angular surveys of galax- 
ies, the main advantage of our data set is its superior pho- 
tometric accuracy. Its main drawback is that it subtends 
less area than both the APM and EDSGC surveys, which 
covered 5000 and 1000 square degrees, respectively (see Ef- 
stathiou & Moody 2001; Huterer et al. 2000). This weakess 
is partly compensated by going deeper (our sample of 1.5 
million galaxies is about half that of APM and 50% larger 
than that of EDSGC) and is of course only temporary, 
since the SDSS will ultimately cover 10 4 square degrees. 

22 As detailed by S2001, each object is assigned a probability be- 
tween zero and one that it is a galaxy based on its observed proper- 
ties. Throughout this paper, we use the sum of these probabilities as 
our estimate of the number of galaxies in a given region. This is more 
accurate than a strict object-by-object maximum-likelihood classifi- 
cation — for instance, if ten objects each have a 10% probability of 
being a galaxy, classifying them all as stars would underestimate the 
true galaxy count by one. 



2.2. The basic problem 
Given a pixelized map Xi and associated shot noise error 

_— 1/2 

bars n i , we compute the angular power spectrum with 
the quadratic estimator method (Tegmark 1997; Bond et 
al. 2000), using KL-compression to accelerate the process 
(Bond 1994; Bunn 1995; Vogeley & Szalay 1996). Since 
this procedure has been described in detail in the recent 
literature (see Tegmark & de Oliveira-Costa 2001 for a 
recent review using our present notation and Huterer et 
al. (2001) for a recent application to galaxy clustering), 
we summarize the method only very briefly here. 

We group our angular density fluctuation map pixels 
Xi into an TV-dimensional vector x. The vector x has a 
vanishing expectation value ((x) = 0) by construction, and 
we can write its covariance matrix as 



C = 



= S + N, 



s = j2pi p i 



(2) 



for a set of angular power spectrum parameters pi and 
known matrices Pi that are given by the map geometry in 
terms of Legendre polynomials. N denotes the contribu- 
tion from shot noise, and is a known diagonal matrix. We 
parametrize the angular power spectrum 



1(1 + 1) 
2tt 



C t 



(3) 



(customarily denoted 8Tf in the CMB literature) as piece- 
wise constant in 50 bands of width A£ = 20, with height 
Pi in the i th band, a?, which is a dimensionless number, 
can roughly be interpreted as the rms fluctuation level on 
the angular scale 8 ~ 180°/^. In summary, knowing the 
power spectrum parameters pi would allow us to predict 
the theoretical covariance matrix of our data via equa- 
tion (0). Our problem is to do the opposite, and estimate 
the parameters pi using the observed data vector x. 
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Fig. 1. — The triangles show the KL-coefBcients y; for the 
21 < r* < 22 magnitude bin (2693 in top panel, first 100 in bot- 
tom panel). If there were no clustering in the survey, merely shot 
noise, they would have unit variance, and about 68% of them would 
be expected to lie between the two horizontal lines (in the band 
— 1 < 3/i < 1). Cosmological fluctuations in the data would increase 
the standard deviation, as indicated by the other thin curves. From 
inside out, the thin curves correspond to the theoretically predicted 
rms fluctuation level y/1 + A^ with our prior power spectrum renor- 
malized by factors 0, (f/3) 2 , (2/3) 2 , 1 and (4/3) 2 , respectively. The 
green/grey curve is the rms of the data points yi , averaged in bands 
of width 25, and shows the cosmological fluctuation signal rising to 
the left. As a rule of thumb, the i th KL-mode probes angular scales 
I ~ (i/jsky) 1 ' 2 ~ 16v% as indicated by upper axis labels, where 
/sky ~ 0.004 is the sky fraction covered in this survey (Tegmark 
1997). Note that green/grey curves nearly match the power per 
mode predicted by the prior model curve (normalization factor 1, 
heavy curve), showing that this model is good for estimating errors. 



2.3. KL-compression 

Since the power spectrum estimation in the next subsec- 
tion involves repeatedly multiplying and inverting N x N 
matrices, and each such manipulation requires of order iV 3 
operations, we apply a data-compression step that reduces 
the size of our data set. We employ the Karhunen-Loeve 
(KL) compression method (Karhuncn 1947; Bond 1995; 
Bunn & Sugiyama 1995; Vogeley & Szalay 1996; Tegmark 
et al. 1997; Szalay et al. 2001), which compresses the in- 
formation content of a map into the first part of a vector 
y = B*x, where B is an N x N matrix whose i th column 
hi satisfies the generalized eigenvalue equation 



Sb, = XiNbi 



(4) 



normalized so that b*Nb^ = 1 and sorted by decreasing 
Xi. The N numbers t/j are uncorrelated, i.e., 



(y i y j ) = \> t i (N + S)b j = (l + X i )5, 



• j ■■ 



(5) 



and their variance (yf) has a contribution of 1 from noise 
and Xi from signal. This means that the eigenvalue Xi 
can be interpreted as a signal-to-noise ratio for y,. The 
first 500 of these numbers yt (KL-coefficients) are shown 
in Figure [j] for the 21<r*<22 band, and it is seen that 



most of the cosmological signal is contained in the first 
few hundred modes. We discard all modes with signal- 
to-noise ratio Xi below unity, which leaves us with 1255, 
1656, 2510 and 2693 modes for the four magnitude bands, 
respectively. This KL-expansion is useful not only to save 
time, but also for systematic error checks. Figure H shows 
that none of the modes deviates from zero by a surprisingly 
large amount (for instance, out of the first 100 modes, typ- 
ically only 5 should deviate by 2c and none by 3c) . A sim- 
ilar KL-compression is performed in Szalay et al. (2001), 
where parameters of the 3D power spectrum are measured 
directly from the KL modes. 2D images of KL-modes for 
a rectangular strip are plotted by Tegmark (1997) and 
Szalay et al. (2001), illustrating that they tend to probe 
progressively smaller angular scales. 

2.4. Integral constraint 

An important complication when computing clustering 
on large scales is the so-called integral constraint. Since 
the mean galaxy density n is a priori unknown, it must 
be estimated from the data itself, implicitly forcing the 
vector x to have zero mean. We tackle this problem by 
only using modes that are orthogonal to the (completely 
unknown) mean, i.e., to the vector e = (1, !,...,!) corre- 



sponding to a constant offset in the map. This idea goes 
back to Fisher et al. (f993) and becomes very simple to 
implement for our pixelized case (Tegmark et al. 1998). In 
principle, it suffices to add a very large noise to the mean 
mode, i.e., to add a huge number M times ee* to the noise 
matrix N, and the subsequent KL-compression will auto- 
matically relegate the mean mode to the list of useless ones 
to be discarded. In practice, we remove the mean mode 
analytically as described in Appendix B of Tegmark et 
al. (1998), which corresponds to the limit where the huge 
number M — > oo. 

2.5. Basic results 

Once our data and the corresponding matrices have been 
KL-compressed (in which x gets replaced by y = B' x, P, 
gets replaced by B'*PjB', N gets replaced by B''NB' = 
I, where the rectangular matrix B' denotes the left part 
of the square matrix B corresponding to the KL column 
vectors we wish to keep) , we proceed to compute quadratic 
estimators pi of our power spectrum parameters pi . The 
results are shown in figures g and g and are listed in Table 
1. 

Since it is important for the interpretation, let us briefly 
review how these measurements are computed from the 
input data, in this case the vector y of KL-modes. A 
quadratic estimator p, is simply a quadratic function of 
the data vector, so the most general unbiased case can be 
written as 

Pi=y t Qiy-s i , (6) 

where the Qi are arbitrary symmetric N x iV-dimensional 
matrices and the Sj = tr [QjN] are the shot noise contribu- 
tions. Grouping the parameters pi and the estimators pi 
into vectors denoted p and p, the expected measurement 
is 



Wp 



(7) 



for a window matrix W that can be computed from the Q^- 
matrices and the sky geometry alone (Wy = tr[PjQj]). 
The Q-matrices are normalized so that each row of the 
window matrix sums to unity. This enables us to interpret 
each band power measurement pi as a weighted average of 
the true power spectrum pj , the elements of the i th row of 
W giving the weights (the "window function" ) . 

The basic idea with quadratic estimators is that each 
matrix Qj can be chosen to effectively Fourier transform 
the sky map, square the Fourier modes in the i th power 
spectrum band and average the results together, thereby 
probing the power spectrum on that scale. We use the 
particular choice of Q-matrices advocated by Tegmark & 
Hamilton (1998) (see Tegmark & Oliveira-Costa 2001 for 
a treatment conforming to our notation) , described in Ap- 
pendix A, which has the advantage of making the error 
bars on the measurements uncorrelated. In other words, 
the covariance matrix for the measured vector p is diag- 
onal (combining shot noise and sample variance errors), 
so it is completely characterized by its diagonal elements, 
given by the error bars in Table 1 and Figure 0. This co- 
variance matrix M = (pp*) — (p)(p)* is generally given 
by My = 2tr[QiCQjC] for the Gaussian case, and our 
particular choice of Q-matrices thus reduces it to a diag- 
onal matrix M,-j = Sjj(Api) 2 . C of course depends on 
p through equation (g), and when computing M to ob- 



tain our error bars Api, we use the "prior" power spectra 
described below, smooth curves fitting our measurements. 

The window functions corresponding to our 50 band 
power measurements (the rows of the matrix W) are plot- 
ted in Figure for the faintest magnitude bin. This con- 
nects our measurements pi to the binned underlying power 
spectrum a\. The windows are seen to have a character- 
istic width of order A£ ~ 50, which is determined by the 
size of our sky patch in the narrowest direction (Tegmark 
1997). We are thus unable to resolve the angular power 
spectrum finer than this because our survey strip is so nar- 
row in the declination direction, limiting the ^-resolution 
to of order A£ ~ 180°/2.5°. Figure also shows a no- 
table transition around £ = 600. This coincides with the 
angular scale where the cosmological fluctuations drop be- 
low Poissonian shot noise fluctuations, and has a simple 
interpretation. On the larger scales where shot noise is 
less important, the Q-matrices weight the galaxies in such 
a way as to make the window functions narrow, thereby 
minimizing the sample variance contribution to the er- 
ror bars caused by power aliased from other scales. On 
smaller scales, the Q-matrices weight all areas of the map 
essentially equally, without bothering with niceties such 
as apodization (down- weighting parts near edges), in an 
attempt to minimize the all-dominating shot noise. This 
results in less well-behaved window functions, which are 
both broader and are seen to have a "red leak" of power 
from substantially larger scales. Since the measurements 
beyond this transition regime are noise dominated and 
contain very little information, producing mere upper lim- 
its, we simply discard them. This cutoff corresponds to 
£ =500, 500, 600 and 700 in the four magnitude bins, re- 
spectively — note that shot noise dominates the brighter 
magnitude bins at lower £, since they contain fewer galax- 
ies. 

To improve the signal-to-noise ratio, we average these 
measurements into bands as specified in Table 1. Since 
the original measurements are uncorrelated, so are these 
averages. The corresponding 14 x 50 window function ma- 
trices for each magnitude bin, which arc necessary for com- 
paring our measurements with theoretical predictions, will 
be publishe d electronically with this article and are al so 
available at \ittp:/ /www. hep. upenn.edu/~max/sdss.htm . 



2.6. Fits and priors 

As mentioned above, we need to use a prior power spec- 
trum consistent with the data to compute accurate error 
bars. To avoid the prior acquiring spurious wiggles caused 
by over-fitting noise fluctuations, it is desirable to use a 
smooth curve with as few tunable parameters as possible 
that nonetheless is consistent with the final measurements. 
As seen in Figure g and Figure ||, the simple "concor- 
dance" model from Wang et al. (2001) provides a good fit 
to the data in all four magnitude bins if we use bias fac- 
tors b = 1.0, 0.9, 0.85 and 0.8, respectively, so we use these 
power spectra as priors. This is a flat neutrino- free model 
with purely scalar adiabatic fluctuations, a cosmological 
constant J7a = 0.66, baryon density h 2 fli, — 0.02, Hubble 
parameter h = 0.64 and spectral index n s = 0.93, nor- 
malized so that linear og = 0.9 for the dark matter. This 
model is well fit by a simple untilted BBKS power spec- 
trum (Bardeen et al. 1986), parameterized by horizontal 
and vertical scaling factors T and os as in Szalay et al. 




SOO 400 

Multipole 1 

Fig. 2. — The angular power spectrum a e = [(.{(. + l)C(/2ir] 1 ' 2 is 
shown for the four magnitude bins. The horizontal location of each 
point and the associated horizontal bars correspond to the mean 
and rms width of the corresponding window function. These mea- 
surements are uncorrelated in the approximation of Gaussian fluc- 
tuations. The curves are the "prior" power spectra used, i.e., the 
concordance model from Wang et al. (2001) with (solid) and with- 
out (dashed) nonlinear evolution, using four separate bias factors of 
order unity as described in the text. 
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Fig. 3. — Same as previous figure, but with logarithmic axes and 
for Ci rather than &£. Although logarithmic axes make window 
functions more difficult to interpret, it facilitates connecting to the 
underlying 3D power spectrum P(k), which is very crudely speaking 
the same curve shifted vertically and horizontally with different axis 
labels. The shifts depend on the magnitude bin: the fainter (and on 
average more distant) the galaxies, the further up and to the left the 
curve should be shifted — up because there is more averaging along 
the line of sight which suppresses fluctuations, to the left because a 
given angular scale £ corresponds to larger spatial scales. The solid 
lines of slope —3 indicate the direction of this shift when the mean 
survey depth is changed. In the absence of relative bias, this shifting 
should place the four curves on top of each other. 



(2001), using (r,cr 8 ) = (0.15,0.9). 

We have corrected for non-linear evolution using the 
Hamilton et al. (1991) approximation as implemented in 
Jain et al. (1996). Figure shows nonlinear evolution to 
be quite important, especially for the brighter galaxies, 
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Fig. 4. — Sample window functions are shown for the band-power 
measurements in the magnitude bin 21 < r* < 22. These are the 
rows of the window matrix W, and connect our band-power mea- 
surements pi to the underlying power spectrum a?. 



with the corresponding linear model substantially under- 
predicting the power on small scales. In Section 3.2 be- 



low, we will see that the central limit theorem nonetheless 
produces a fairly Gaussian 2-dimensional projected galaxy 
distribution because of averaging along the line of sight. 

We use this cosmological model merely for a convenient 
parametrization of our prior - - physical interpretation 
must take into account selection function uncertainties, 
etc., and the reader is referred to Dodelson et al. (2001) 
and Szalay et al. for a detailed treatment of this. The 
slight differences in normalization may reflect clustering 
evolution, differences in bias properties between the four 
samples or some combination thereof. 

On angular scales much smaller than a radian (the 
small-angle approximation), the slope n of a power-law 
angular power spectrum Cg is related to the power law 
slope m of the angular correlation function w(6) — 9 m 
by m + n = —2, so the typical power law slopes of order 
n ~ —1.5 in Figure |2| correspond to correlation function 
slopes of order m ~ —0.5, in good agreement with the 
w(6) measurement in Connolly (2001). 

2.7. Relation to 3D power spectrum 

Let us conclude this section by briefly commenting on 
how to interpret our measurements. In a companion paper 
(Dodelson et al. 2001), the present results and those on 
the angular correlation function from S2001 are used to 
recover an estimated 3D power spectrum P(k). Here we 
present the relevant window functions that are used as a 
starting point for such analyses. 

As described by Huterer et al. (2001) and Appendix B, 
the angular power spectrum Ce is related to the 3D power 
spectrum P{k) via the simple relation 



Ct 



f e (k) 2 P(k)k 2 dk, 



where the dimensionless function 
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Fig. 5. — The solid curves show the exact fc-space window function 
corresponding to C e for multipoles I = 1, 2, 4, 8, 16, 32, 64, 128, 
256 and 512, respectively, renormalized to have unit area, using 
the radial selection function for magnitude bin 21 < r < 22. The 
dashed curves show the same window functions computed in the 
small-angle approximation. These window functions connect the 
angular power spectrum Cg to the underlying 3D power spectrum 
P(k) via equation (H). 



Here / is the probability distribution for the comoving 
distance r to a random galaxy in the survey, optionally 
weighted by an evolution factor, and ji is a spherical Bessel 
function. In other words, the integral kernel transform- 
ing from 3D to our angular 2D case is simply a Bessel- 
transform of the radial selection function. A sample of 
these integral kernels are plotted in Figure h. Accurate 
approximations of this kernel are available in the small- 
angle limit, but we use the full expression here since it 
is so simple (computational details are given in Appendix 
B), and since scales where sky-curvature is non- negligible 
will eventually be well probed by the SDSS. 

By taking linear combinations of the kernels from Fig- 
ure H corresponding to our ^-space window functions, we 
obtain the kernels of Figure H, showing which k- values each 
of our band-power measurements is probing. This enables 
us to interpret our band-powers as measuring weighted 
averages of the 3D power spectrum P(k) as shown in Fig- 
ure pi. This plot is by no means a substitute for a thorough 
reconstruction of the 3D power spectrum as in Dodelson 
et al. (2001), incorporating selection function uncertainties 
etc, but provides a useful rough guide as to which spatial 
scales are probed and, in particular, as to the ^-values for 
each magnitude bin beyond which nonlinear clustering is 
likely to be important. 

To gain further intuition about the relation between Ct 
and P(k), an additional approximation is instructive. As 
shown in Appendix B, the 2D and 3D power spectra are 
approximately related by 
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Ct « ^P(k), 



(10) 



Fig. 6. — The curves show the fc-values probed by our 14 band 
power measurements for the faintest magnitude bin, thereby con- 
necting what we measure to the 3D power spectrum P{k ) In other 
words, these window functions, defined by equation (Bll), are anal- 
ogous to those in Figure M, but in fc-space rather than f-space. 



where k = ftl/r* and r* is the mean spatial depth of the 
survey. The key approximation made here is that Cg in 
fact probes not simply the power P at wavenumber k, but 
rather a weighted average of P with a window function 
of width Afc ss 7/0. Here a, (3 and 7 are dimensionless 
constants of order unity that depend only on the shape 
of the radial selection function, not on its depth. For the 
SDSS case described in Dodelson et al. (2001), a « 0.75, 
j3 rs 1.1 and the smoothing width 7 = Ak/k « 0.37. 

In other words, we can interpret Ci as a smoothed ver- 
sion of P(k) shifted vertically and horizont ally in a log-log 
plot such as Figure |[ Moreover, equation ( J10| ) shows that 
mis-estimates of the radial selection function depth r* will 
simply shift the entire P(fc)-curve along the lines of slope 
—3 shown in Figure u\ without changing its shape. 

3. ROBUSTNESS AND LIMITATIONS OF RESULTS 

How reliable are the angular power spectrum measure- 
ments computed above? In this section, we discuss the un- 
derlying assumptions and their limitations. We focus on 
three areas and discuss them in turn: potential problems 
with the input data, potential problems with the data pro- 
cessing (analysis algorithms/software) and potential prob- 
lems with underlying assumptions, notably Gaussianity. 

3.1. Issues related to the input data 

The input data used in our analysis have been exten- 
sively tested for potential systematic errors by S2001, and 
constitute arguably the cleanest deep angular survey data 
to date. In particular, S2001 present a battery of tests for 
problems involving star-galaxy separation and modulation 
of the galaxy detection efficiency by external effects such 
as photometric calibration, seeing conditions and Galactic 
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Fig. 7. — The same band power measurements as in Figure 13, but 
plotted in fc-space using the window functions from Figure pi Specif- 
ically, the data points are the rescaled band power coeffici ents w hich 
probe a weighted average of P(k) as specified by equation (Bll ). An 
inversion from 2D to 3D power spectra is performed in Dodelson et 
al. (2001) — the data plotted here are merely the input to those 
calculations. For comparison, the solid curve is the ACDM "con- 
cordance" model from Wang et al. (2001) with (solid) and without 
(dashed) nonlinear evolution. If the mean depth has been under- 
estimated for one of the Galaxy samples, the corresponding points 
should shift up to the left along the dotted lines of slope —3. 



extinction. By cross-correlating the galaxy maps with vari- 
ous two-dimensional "trouble templates" corresponding to 
variations in seeing, reddening, stellar density, camera col- 
umn structure, etc., the various effects were quantified and 
reduced to negligible levels by sharpening the seeing and 
reddening cuts. This gives us confidence that the errors in 
our star-galaxy separation algorithm (which depends on 
seeing) and reddening estimates have negligible effect on 
our estimates of angular power spectra even in the faintest 
magnitude bin. 

As an additional precaution, we complement the tests 
from S2001 with three that are tailored for our C^-analysis. 
Specifically, we compute the angular power spectra of the 
seeing and reddening templates, which were found to be 
the most serious challenges in S2001, and with a photo- 
metric calibration error template. Strictly speaking, these 
of course do not have well-defined power spectra, since 
they are not isotropic random fields. Rather, what is rel- 
evant here is the amplitude and shape of the bias that 
they would add to our estimates of the galaxy power spec- 
tra. We therefore process these templates in exactly the 
same way as the galaxy maps, with the pair-weightings 
(the Qi-matrices) given by the galaxy noise and signal ma- 
trices. We use the weighting and sky mask corresponding 
to the faintest magnitude bin, since this is the one that 
is most vulnerable to these systematics — both because 
these galaxies have the poorest signal-to-noise ratio in the 
CCD photometry (Lupton et al. 2001) and because they 
have the lowest intrinsic angular clustering amplitude. 

We use the same seeing and reddening templates as 



S2001, i.e., the second moment of the point-spread func- 
tion for each pixel and the extinction correction from 
Schlegel et al. (1998). In order to provide a meaningful 
comparison between the amplitudes of signal and system- 
atics, we need to estimate the conversion factor from see- 
ing or reddening power to galaxy fluctuation power. We 
do this using the cross-correlations presented in figures 8 
and 9 of S2001. To be conservative and err on the side 
of caution, we use the relevant 2a cross-correlation upper 
limits, 0.0017 and 0.0038, for seeing and reddening, re- 
spectively. These values are the largest upper limit on any 
angular scale, but we have used them at all angular scales 
I to be conservative. 

The corresponding angular power spectra for seeing and 
extinction are shown in Figure || and, as opposed to the 
galaxy fluctuations, they are seen be flat or rise towards 
larger angular scales. For the reddening case, this is in 
good agreement with the findings of Vogeley (1998) and 
measurements of the dust power spectrum. The combined 
DIRBE and IRAS dust maps suggest a power law Ci ex 
£~ 2 - 5 (Schlegel et al. 1998), and a recent analysis of the 
DIRBE maps has supported an even redder slope with an 
£' 3 power law for I < 300 (Wright 1998). 

As a template for photometric calibration errors, we 
identify a feature of the stellar distribution in color space 
and measure it as a function of position in the sky. As 
seen, e.g., in Finlator et al. (2000), the locus of stars in 
the g — r,r — i color plane shows two branches: stars cooler 
than ~M0 have almost constant g — r colors, while hot- 
ter stars show a strong correlation between the g — r and 
r — i color. The crossing point of linear fits to the stellar 
locus in these two branches should be independent of posi- 
tion on the sky, thus variations in this crossing point are a 
sensitive measure of photometric calibration errors in g, r 
and i. Similarly, the stellar locus in the r — i, i — z plane is 
almost linear; one can define the i — z color corresponding 
to the r — i color of the crossing point measured from the 
g — r,r — i plane. 

We have measured the crossing colors from the stars in 
our sample on scales of two degrees by 13 arcminute (the 
width of a scanline), and attribute all observed variations 
to errors in r* to be conservative; the distribution of the 
implied error is roughly Gaussian, with a sigma of 0.015 
magnitudes. We convert these r* -fluctuations into den- 
sity fluctuations by multiplying by the source count slope 
din N/ dr. This slope is of order unity at r* = 19 and 
flattens at fainter magnitudes (Yasuda et al. 2001), so we 
make the conservative assumption dlnN/dr = 1. The an- 
gular power spectra of these two calibration error maps 
are shown in Figure H and are seen to be approximately 
flat (scale- invariant). 

the almost horizontal part towards the left, and stars 
later than ~M0 are found in the vertical branch with g' — 
r' ~ 1.4 

It is reassuring that even with the extremely pessimistic 
assumptions described above, the expected contaminant 
signals remain much smaller than the observed galaxy 
power spectrum all the way out to the largest scales cur- 
rently probed. Extrapolation to extremely large scales 
suggests that even extinction should remain subdominant 
for I > 5 - 10. 

In summary, we have found that systematics are small 
even in the nearly worst-case scenario shown in Figure pi 
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Fig. 8. — To give a feeling for the magnitude of potential system- 
atic errors, the power spectrum in the faintest (and most vulnerable) 
magnitude bin is compared with the power spectra for seeing, ex- 
tinction and photometry related modulations of the observed galaxy 
density under very pessimistic assumptions. 



Moreover, the SDSS will provide other internal checks on 
many systematics, notably the extinction correction, so 
these arc unlikely to prove a significant limitation on de- 
termining the power spectrum. 

3.2. Issues related to algorithms, software and 
assumptions 

Since our analysis consists of a number of somewhat 
complicated steps, it is important to test the integrity of 
both the software and the underlying methods. We do this 
using two types of Monte Carlo simulations: 

1. We analyze 1000 Monte-Carlo maps x that are drawn 
from a multivariate Gaussian distribution with van- 
ishing mean and covariance matrix C. 

2. We analyze 100 Monte-Carlo galaxy samples includ- 
ing non-linear clustering as described in Scoccimarro 
& Sheth (2001) and S2001. 

Both sets of mock data were processed through our anal- 
ysis pipeline, enabling us to check not only whether we 
obtained the correct answer on average, but also whether 
the scatter and the error correlations corresponded to the 
predicted values. The first suite of Monte Carlos offered 
precision end-to-end tests of the algorithms and the soft- 
ware, since errors or bugs in any of the many intermediate 
steps would have manifested themselves here. They used 
the exact same survey geometry as the real data, including 
the seeing and reddening masks of S2001. 

The second suite of Monte Carlos provides a way of 
quantifying the limits of applicability of the Gaussian 
assumption. They were constructed using the PTHalos 
code (Scoccimarro & Sheth 2001) as described in detail in 
Scranton et al (2001), covering a rectangular 90° x 2.5° sky 
region. In short, this code is a fast approximate method 



Fig. 9. — The simulated kurtosis of the first 1957 KL-coefncicnts 
is shown for the 18 < r < 19 magnitude bin. Each triangle repre- 
sents the kurtosis of the distribution for the corresponding mode as 
measured from the 100 nonlinear Monte Carlo simulations described 
in the text. The thick curve shows a running average of 25 modes, 
whereas the thin horizontal line shows the Gaussian prediction of 
three. 



to build non-Gaussian density fields with realistic corre- 
lation functions, including non-trivial galaxy biasing (ob- 
tained by placing galaxies within dark matter halos with 
a prescribed halo occupation number as a function of halo 
mass). 

The non-Gaussian effects produced by non-linear evolu- 
tion encode information that can be captured by measur- 
ing higher-order moments and other statistics. Since this 
route is explored in detail in Szapudi et al. (2001), we will 
not pursue it here. However, we need to quantify the level 
to which this non-Gaussianity affects our results. 

Since our power spectrum estimates are simply quadratic 
functions of the density field, they give unbiased mea- 
surements of the underlying power spectrum even if the 
fluctuations are non-Gaussian. In other words, our calcu- 
lation of window functions, KL-modes etc. is completely 
general and does not make any assumptions about Gaus- 
sianity. The one place in the quadratic estimator formal- 
ism where Gaussianity is assumed is in the computation 
of error bars. Since the variances of our power spectrum 
estimates involve fourth moments of the observed density 
fields (kurtosis), they will generally differ from the Gaus- 
sian prediction in the presence of non-Gaussianity — typi- 
cally by being larger. The covariance between band power 
estimates likewise involves fourth moments, so we should 
not expect our error bars to retain their attractive prop- 
erty of being uncorrelated down into the nonlinear regime. 
The third moment (skewness) of the galaxy distribution 
also affects the power spectrum error bars via coupling to 
the Poissonian shot noise, at a level of the same order of 
magnitude as the kurtosis. 

To quantify this effect, Figure shows the kurtosis of 
the first 1957 KL-coefficients for the 18 < r < 19 magni- 



tude bin. (This brightest magnitude bin is expected to be 
the most non-Gaussian, both because it probes the small- 
est spatial scales and because it involves the least amount 
of line-of-sight averaging — such averaging makes the den- 
sity field mory Gaussian as per the central limit theorem .) 
The kurtosis was computed by processing the f 00 nonlin- 
ear Monte Carlo simulations through our analysis pipeline 
and computing the variance and fourth moment of the 100 
values obtained for each mode. The dimensionless kurtosis 
plotted is the fourth moment divided by the square of the 
variance, i.e., (yt)/(yf} 2 , and would equal three for Gaus- 
sian fluctuations (in which case the KL-coefficients would 
be simply independent Gaussian random variables). Since 
we have only 100 simulations, there is still a fair amount 
of scatter. To further reduce the scatter, we have therefore 
added a line in Figure showing a running average of 25 
consecutive triangles. The scatter (which is determined 
by eighth moments) appears to rise somewhat initially, as 
modes probe progressively smaller angular scales, then de- 
creases again as cosmological fluctuations become smaller 
than Poisson shot noise fluctuations. However, the kurto- 
sis itself, which (with the skewness) is the only quantity 
that affects our error bars, is seen not to depart signifi- 
cantly from the Gaussian value on any of the angular scales 
we have probed. This implies that non-Gaussianity does 
not appear to have a major impact on our results. This is 
partly by design, since we chose to focus our analysis on 
the largest scales. 

In other words, although non-Gaussian effects are very 
strong on small scales (indeed, the onset of non-linear evo- 
lution is evident in Figure 0), they have only a weak ef- 
fect on the error bars of our large scale angular power 
spectrum. Factors contributing to this are the dominance 
of shot noise on small scales (on a mode-by-mode basis) 
as well as the central limit theorem, suppressing non- 
Gaussianity by averaging fluctuations along the line of 
sight. 

In conclusion, we have computed the large-scale angu- 
lar power spectrum from early SDSS data and performed 
a series of tests validating our results. The cosmological 
implications of our measurements are discussed in a com- 
panion paper by Dodelson et al. (2001). Although these 
results are interesting in their own right, perhaps the most 
important conclusion is that the lack of discernible sys- 
tematic errors even on scales as large as tens of degrees 
bodes extremely well for analysis of future SDSS data. 
The present data covered about 160 square degrees, i.e., 
less than 2% of the full survey that will eventually be avail- 
able, so angular clustering studies are likely to remain at 
the forefront of the quest for a detailed understanding of 
cosmic clustering. 
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APPENDIX A 

THE RELATION BETWEEN 
DIFFERENT QUADRATIC ESTIMATORS 

The purpose of this Appendix is to describe the Q- 
matriccs that define our analysis as well as to elucidate 
the relationship between quadratic estimators of Ci, w(9) 
and P{k). From an information-theoretic point of view, we 
will see that the key issue is not which of the three func- 
tions one tries to measure, but what pair weighting is used 
in the process — the minimum-variance weighting retains 
all information about all three of them in the Gaussian 
approximation. Indeed, we will see that the decorrelated 
minimum- variance estimators of all three functions are one 
and the same set of numbers, just normalized differently! 

A.l. The (^-matrices used in our analysis 



As described in Section 2.5 



our power spectrum es- 
timators are quadratic functions of the observed galaxy 
density. The estimator of the power in the i th band is 
therefore defined by a symmetric matrix Q; that gives the 
weight assigned to each pair of pixels (or KL-coefficients) 
via equation (1q). In our analysis, we make the choice 



Q 4 =53(B)«C- 1 P i C" 



(Al) 



for a 50 x 50 matrix B that will be defined below. It can 
be shown (Tegmark 1997) that this choice distills all the 
cosmological information from the original galaxy map x 
into the (much shorter) vector p in the approximation of 
Gaussian fluctuations, as long as the matrix B is invertible 
and the binning scale A£ is narrower than the scale on 
which the power spectrum varies substantially. In this 
approximation, the mean and covariance of the quadratic 
estimator vector p defined by equation (o) is given by 



M 



where 



(PP 



1 



(p) - BFp, 

(p)(p)' = BFB', 



F l3 = -tr [C-'PiC-'Pj] 



(A2) 

(A3) 

(A4) 



is the so-called Fisher information matrix. As advocated 
in Tegmark & Hamilton (1998), we choose B = DF^ 1 / 2 , 
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where D is a diagonal matrix whose elements are chosen 
so that the window matrix W = BF has unit row sums. 
This choice has the virtue of giving uncorrelated error bars 
(the covariance matrix of equation (jA3) becomes the diag- 
onal matrix M = D 2 ) and narrow, well-behaved window 
functions as seen in Figure 0. 

A. 2. The relation between quadratic estimators ofC'i, 
w{9) and P(k) 

Suppose the angular power spectrum parameters pi can 
be expressed as linear combinations of some other param- 
eters p'i, i.e., 

p = Ap' (A5) 

for some matrix A. There are two such interesting exam- 
ples, involving w(9) and P{k), respectively. If we define 
p\ = w(9i), i.e., the angular correlation function amplitude 
in the i th angular bin, then A is given by 



1 

2^ 



P^cos^sin^Afl, 



(A6) 



where Pi is a Legendre polynomial and A9 is the width 
of the angular bins. If we define p\ = P(ki), i.e., the 3D 
power spectrum in the i th fc-bin, then A is given by 



K e {ki)kfA]nk, 



(A7) 



where Kg is given by equation ( |B2| ) and A In k is the width 
of the (logarithmic) fc-bins. (Throughout this subsection, 
we assume for simplicity the 9- or fc-bins are narrow enough 
to resolve any features in w(9) or P(k), and that there is 
no ^-binning, defining v? — Cg.) 

Using equation (|A5|), we can construct quadratic esti- 
mators p' to measure p' directly, without going through 
the intermediate step of measuring the angular power spec- 
trum p first. Writing S = ^2p[Pl by analogy with equa- 
tion (0), the new P-matrices are given in terms of the old 
ones by 

P^ = ]TA^P,. (A8) 



Using equations (q) and (Al) therefore shows that the new 
estimators are related to the old ones by 



5' = B'A f B - 1 ! 



(A9) 



Here B' is the B-matrix corresponding to the new pa- 
rameters p', and we use the same notation with primes ' 
for other matrices below. To obtain an intuitive under- 
standing for this relation, let us simplify things by using 
the choice B = DL _1 in place of our previous choice 
B = DF -1 / 2 , where L is the lower-triangular matrix 
obtained by Cholesky-decomposing the Fisher matrix as 
F = LL*. L can be viewed of as simply an alternate 
choice of square root of F. As described in Tegmark & 
Hamilton (1998), this choice has the same desirable prop- 
erties as B = DF -1 / 2 except that it gives asymmetric 
window functions (F 1 / 2 is symmetric whereas L is not). 
A straightforward calculation shows that F' = A*FA, so 
L' = A f L and equation ( [A9| ) reduces to 

p'^D'L'^A^DL- 1 )-^ 

= (D'L^A^A'CLD^p = D'D^p, (A10) 



a diagonal matrix. In other words, if we use the same 
number of £- values as there are bins (for 9 or k), with A 
an invertible square matrix, then the old estimators pi and 
the new estimators ^ are the exact same numbers except 
differently normalized! The normalization factors Ha and 
D,^ ; simply let us interpret the measurements as probing 
weighted averages of p and p', respectively. 

This shows that there is no fundamental difference be- 
tween measuring Cg, w{9), P{k) or some other linear trans- 
formation of the power spectr um with quadratic estima- 
tors of the form of equation (Al). Not only do they all 
contain the same information (keeping the P-matrices the 
same, two different p computed with different B-matrices 
are trivially related by p' = B'B _1 p), but even the B- 
matrices will be essentially the same if we decorrelate the 
measurements. This means that the rescaled C^-estimates 
shown in Figure R can alternatively be interpreted as 
decorrelated quadratic estimators of P(k), or as rescaled 
decorrelated quadratic estimators of w(9)\ The reason that 
this paper purports to measure Ct rather than w(9) is sim- 
ply that the window-functions for our estimators turn out 
to be narrow and well-behaved in ^-space, but wide and 
partially negative in 0-space. 

A. 3. The relation between different pair weightings 

In the companion paper by Connolly et al. (2001), the 
angular correlation function w(9) was measured with a 
different technique, using so-called Landy-Szalay (LS) es- 
timators. LS-estimators are also quadratic estimators, and 
in our notation co rres ponds to replacing the Q-matrix 
choice of equation (Al) by 



Q; 



NiPi 



(All) 



For the w(9)-ca.se, the P-matrices take the simple form 
Pjfe — 1 if the angular separation between pixels j and 
k falls in the angular bin around 9k, vanishing otherwise. 
The normalization constants Ni are simply the number 
of pixel pairs with angular separation in the i th angular 
bin, s o Ni = trP 2 . The estimators corresponding to equa- 
tion ( All ) are not simply related to those corresponding 
to equation (Al) since the C _1 -weighting is absent. They 
therefore do not contain the same cosmological informa- 
tion. However, they have two other very desirable proper- 
ties. The first is that the window matrix from equation (Q) 



W,: 



trpPfQj-] = JVitr fPiPj-] =%, 



(A12) 



the identity matrix. This means that the LS quadratic es- 
timators can be interpreted as exact measurements of w(9) 
with no smoothing whatsoever. The second advantage is 
computational speed. Since no time-consuming matrix in- 
versions are necessary, the LS-estimator of w{9) can be 
computed with many more pixels than would otherwise 
be feasible, probing the clustering down to far smaller an- 
gular scales than we have probed in this paper. Finally, it 
is worth noting that the los sless property of the quadratic 
estimators of equation (Al) breaks down on small scales 
where fluctuations become non-Gaussian, making the com- 
putationally superior LS-estimators preferable in this regime. 
The bottom line is that the C^-estimation used here and 
the LS-estimation of w(9) used in Connolly et al. (2001) 
are highly complementary approaches, being preferable on 
large and small angular scales, respectively. 
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APPENDIX B 

THE RELATION BETWEEN C e AND P(k) 

In this Appendix, we discuss the close relation between 
the angular power spectrum Ce that we have measured 
and the underlying 3D power spectrum P(k). The pur- 
pose is both to review their exact quantitative relation, 
and to provide qualitative intuition for this relation and 
how it is affected by mis-estimates of the radial selection 
function. As we will see, Ce can be interpreted as essen- 
tially a smoothed version of P(k) shifted horizontally and 
vertically on a log- log plot, and a mis-estimate of the mean 
survey depth would shift the power spectrum along lines 
of slope —3. 

B.l. The exact relation 

The angular power spectrum Ce is related to the 3D 
power spectrum P(k) by 



Ci = 



K e (k)P(k)k 2 dk 



(Bl) 



for a dimensionless integral kernel Ke{k) that depends on 
the radial selection function of the survey. As shown in 
Appendix A of Huterer et al. (2001), 



Ke(k) = -fi(ky 

IT 



(B2) 



where fe is the Bessel transform of the radial selection 
function f(r) as given by equation (|9j). Specifically, /(r) = 
g(r)h(r), where g(r) is the probability distribution for the 
comoving distance from us to a random galaxy in the sur- 
vey and h(r) is an optional (bias and clustering) evolution 
term of order unity, so g has units of inverse length and h 
is dimensionless. Defining N to be the expected number of 
galaxies within a sphere of a certain radius, we thus have 



. , dN dN/dz H(z)dN 
g(r) oc — = = — — — , 

dr dr/dz -fio r o dz 



normalized so that 



g(r)dr = 1. 



(B3) 



(B4) 



Here r = c/H « 3000/i 1 Mpc, and the relative Hubble 
parameter is 

H(z) 



Ho 



VflA + (1 - fi A - fi m )(l + Zf + On(l + Zf 



(B5) 

for a cosmology with density parameters Q m and Q\ for 
matter and vacuum energy, respectively. This means that 
uncertainties about dN/dz and uncertainties about the 
cosmological parameters (il m , £Ia) get combined, entering 
only via the single function g(r). 

The evolution term h(r) relates the past and present 
galaxy clustering amplitudes, and is given by 



h= [P(k;z)/P(k)} 1/2 



(B6) 



for a flat Universe. If space should turn out to be curved 
despite present evidence to the contrary, h gets multiplied 



by a correction factor as in Peebles (1980). The factor 
h is likely to remain close to unity for the low redshifts 
z < 0.5 probed by the SDSS, especially since the effects of 
bias evolution and dark matter clustering evolution appear 
to partially cancel (Blanton et al. 2000). The clustering 
evolution is expected to be small over this redshift range 
since linear growth grinds to a halt at recent times when 
vacuum energy becomes dominant. Rather than attempt- 
ing a complicated and poorly justified model for h(r), we 
therefore simply set h(r) = 1 and reinterpret the mea- 
sured P(k) as the power spectrum at the effective redshift 
corresponding tor~ £/k (Dodelson et al. 2001). 

In practice, we evaluate the Bessel-transform of equa- 
tion ffl) for 512 logarithmically equispaced k- and r-values 
using Fourier methods from the FFTlog package of Hamil- 
ton (2000). This is an efficient NlogN algorithm, evalu- 
ating all kernels up to £ = 1000 in about a minute on a 
workstation. Sample results are shown in Figure H using 
the selection function for the 21 < r < 22 band described 
by Dodelson et al. (2001) for a Universe with il m = 0.3, 
0\ = 0.7. 

B.2. The small-angle approximation 
The approximation 



K e (k) 



ek 2 



f 



(B7) 



becomes accurate in the small-angle limit (see, e.g., Kaiser 
1992; Baugh & Efstathiou 1994), as illustrated in Figure § 
This is the £-space version of Limber's equation, which re- 
lates P(k ) to the angular correlation function w(9). Equa- 
tion (B7) can be derived directly from equation (B2) by 
noting that for large £, the spherical Bessel function ji(kr) 
becomes sharply peaked around kr — £. Assuming that 
f(r) is a smoothly varying function relative to this peak 
width, we can thus approximate it by f(£/k) and take it 
out of the integral in equation (M) , obtaining 



//(*) « / 



j e (kr)dr 



\2k 2 £) J 



since 




for I > 1. 



je(x)dx 



B.3. k-space window functions 



(B8) 



(B9) 



Since our measured band powers probe linear combi- 
nations of the actual power spectrum coefficients Ce, and 
these in turn are linear combinations of P(k), we can rein- 
terpret our band-power measurements pi as probing P{k) 
directly. In other words, the window matrix W from equa- 
tion (Q) relates our measurements to Ce and the kernel 
Ke(k) of equation (Bl) relates Ce to P(k), so combining 
the two relates our measurements to P(k). Specifically, 
these two equations give 

/•OC 

(&)=/ y^WuKe{k)P{k)k 2 dk (B10) 

Jo t 

for the case of no ^-binning. Since we have binned our 
angular power spectrum in £-bins of width A£ = 20, the 
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sum over I in equation (B1C) gets replaced by a sum over 
bins and Kg( k) ge ts replaced by its average over each ^-bin. 
Equation ( B10| ) is seen to take the simple form (pi) = 
f Wi(k)P(k)dhxk for functions W,{k) = J2gW u K e {k)k 3 
that are never negative. Defining normalization constants 
Ci = J Wi(k)dhik, this means that we can interpret our 
measurements pi as probing simply a times weighted av- 
erages of P(k) with weight functions Wi(k)/ci. However, 
caution is necessary before using this fact to make plots 
like Figure 0. The reason is that if the window functions 
Wi(k)/ci are wide (which they are) and the function to 
be measured varies substantially on the scale of this win- 
dow (which -P(fc) typically does), then the weighted av- 
erage will be dominated by one edge of the window. For 
instance, in the regime of Figure fl where P(k) is rapidly 
falling, the integral J Wi(k)P(k)dln k would be dominated 
by the contribution from fc-values leftward of the peak of 
Wi(k), causing the corresponding point in Figure ffl to be 
plotted misleadingly far to the right. Such problems can 
be avoided by redefining the quantity to be measured to 
be a roughly constant function. This is why we chose to 
measure af rather than Cg above. Following, e.g., Eisen- 
stein & Zaldarriaga (2000) and Hamilton et al. (2000), we 
therefore interpret our measurements as weighted averages 
of the relative power spectrum, defined as P(k)/P*(k), 
where P,(k ) is our fiducial power spectrum described in 
Section [2.5| . This relative power will be a fairly constant 
function (of order unity) as long as the shape of our fidu- 
cial power spectrum is not grossly inconsistent with the 
truth. We therefore write 



{%)= I Wi(*)3idln/.-. 



P*{k) 
where we have defined window functions 

Wi(k) = P4k)k 3 Y,W a K e (k), 



(Bll) 



(B12) 



These functions are plotted in Figure |g for the faintest 
magnitude bin. This equation is analogous to equation (fn), 
linking our measurements to the 3D power spectrum P(k) 
rather than the angular spectrum Cg. The rescaled band- 
power coefficients pi/ci are thus weighted averages of the 
relative power spectrum, where c, = J Wi(k)dh\k as be- 
fore. The numbers (-P*(fe)/ c i)Pi can therefore be viewed as 
a measurements of P(k), and are plotted in Figure (7] at the 
fc-values corresponding to the means of the distributions 
Wi(k), with horizontal bars indicating the rms widths of 
Wi(k). The results are seen to be roughly consistent be- 
tween magnitude bins and in agreement with a standard 
ACDM power spectrum. 

B.4. The narrow window approximation and the poor 
man's Limber inversion 

Let us now make an approximation aimed at building 
qualitative intuition for how Cg is related to P(k) and, 
in particular, for how this relation depends on the details 
of the selection function f(r). Figure || shows that the 
kernels Kg (fc) from equation ( Bl ) are fairly narrow positive 



functions with a single peak, 
as 



Defining their n moments 



(k n )g= / Kg(k)k n dlnk, 



(B13) 



they are therefore roughly characterized by their areas Ag 
means kg and rms widths Akg given by 



A, 



(k°h 



(fci) 
A f 



and Akg 



(fc 2 



A, 



1/2 



respectively. In the crude approximation that the widths 
Akg of these curves are smaller than the scale on which 
the dimensionless power k 3 P(k) varies appreciably, we can 
approximate k 3 P(k) by kfP(kg) in the integral of equa- 
tion (Bl), obtaining simply 



Cr 



Kg(k)P{kg)k 3 l d\\ik = Agk\P{kg) 



(B15) 



Since P(k) is roughly a power law near any given fc, this 
approximation is accurate in the limit where Akg/kg -C 1. 
To highlight the scale dependence of the problem, let us 
define the mean comoving distance by 



f(r)rdr 



(B16) 



and the dimensionless probability distribution function 
(PDF) for x = r/r* by 



/*(«) = r*/(r»»). 



(B17) 



The PDF / thus has both area and mean of unity, and 
quantifies only the shape of the radial selection function, 
with r* encapsulating the physical depth of the survey. 
Since f(r) = f*(r/r*)/r*, sub stituting the s mall angle 
approximation of equation (B7) into equation (B13|) now 



gives 



(fc") 



a n £ 



n— 3 



where the dimensionless constants 



f*(x 



iV" 



n dx 



(B18) 



(B19) 



are all of orde r unity. Substituting equation ( B18| ) into 
equation (B14) now gives the simple results 



r* 

Akg = -j kg, 



Ae 

kg = 



(B20) 



where (3 = a%/ao and 7 = {olqUiI a\ — l) 1 ' 2 are again di- 
mensionless constants of order unity that depe nd on ly on 
the shape func tion /*. Substituting equation (p20) into 
equation ( p315|) thus gives the extremely simple formula of 
equation (p.0[), where a = «o/3 2 = af/a 2 ,. Equation (10) 
tells us that we can perform a "poor man's Limber inver- 
sion" by simply making a log-log plot of Cg and changing 
the axis labels to fc and P(k), respectively, making the 
substitutions 



* C 



P= C -. 

a 



(B21) 



The key caveat is that the resulting plot shows not the 
true -P(fc) but a smoothed version thereof, with a roughly 
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constant smoothing width A In k = 7 on our logarithmic 
fc-axis. 

For the SDSS selection functions described in Dodel- 
son et al. (2001), a « 0.75, (3 w 1.1 and the smooth- 
ing width 7 = Ak/k rs 0.37 for all four magnitude bins. 
Even major changes in the functional form of the radial 
selection function do not change these shape parameters 
by large amounts. In contrast, the mean survey depth 
varies substantially, with r* = 515/i _1 Mpc, 710/i _1 Mpc, 
947/i _1 Mpc and 1193/i _1 Mpc for the four magnitude bins, 
respectively, with non-negligible uncertainty (Dodelson et 
al. 2001). 

Since the relative windows wi dths Ak/k are so large, 
it is important to use equation ( B12J ) rather than equa- 
tion ( flC)| ) on the largest scales, where k 3 P(k) is far from 
constant. Moreover, the additional smearing A£ ~ 20 
caused by our finite sky coverage becomes important on 
large scales, since it corresponds to a rock-bottom smooth- 
ing scale Afc « j3A£/r lr that does not decrease with k. 

Perhaps the most useful feature of equation (Jig) is that 
it explicitly shows the effect of changing the radial selec- 
tion function. If the shape /* has been correctly estimated 
but the mean survey depth r* has been overestimated, 
then equation (fLOt) shows that the inferred power spec- 
trum P(k) will be too far up to the left — up because 
there was in fact less averaging along the line of sight sup- 
pressing the observed power Ci , to the left because a given 
angular scale £ in fact corresponds to larger spatial scale. 
Any such errors will therefore slide the entire P(k) curve 
along the solid lines of slope —3 shown in Figure []. 

Although the dominant uncertainty is likely to arise 
from the mean depth r*, the dependence on the shape 
(as opposed to the mean depth) of the selection function 
is also rather intuitive. If the galaxies are more concen- 
trated around their mean distance (if /* (x) is more sharply 
peaked around x = 1), then a straightforward calculation 
shows that the normalization a increases and the smooth- 
ing width 7 = Ak/k decreases. The first effect corresponds 
to less averaging down of fluctuations along the line of 
sight, increasing Ci for a given P(k). The second effect 
corresponds to less aliasing, since the relation between an- 
gular separation and transverse spatial separation tightens 
when the galaxies become less spread out radially. Since 7 
is essentially the width of the the radial selection function 
in units of the mean depth (more precisely, this ratio for 
the square of the selection function, which is more peaked 
and therefore gives a smaller number), it is difficult to 
obtain smearing jA/k < 25% for realistic selection func- 
tions. However, much smaller 7-values of course become 
possible if photometric redshifts are used to define galaxy 
samples in narrow radial bins. 
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18 < r < 19 


19 < r < 20 


20 < r < 21 


21 < r < 22 


£ 


o\ 


I 


°l 


£ 


a\ 


£ 


a\ 


13 ± 8 


-0.0006 ± 0.0028 


13 ± 8 


0.0002 ± 0.0011 


13 ± 8 


0.0004 ± 0.0005 


13 ± 8 


0.0008 ± 0.0003 


31 ± 15 


0.0115 ± 0.0048 


31 ± 15 


0.0053 ± 0.0021 


30 ± 15 


0.0017 ± 0.0010 


30 ± 15 


0.0009 ± 0.0006 


51 ± 18 


0.0132 ± 0.0059 


50 ± 18 


0.0067 ± 0.0028 


50 ± 18 


0.0034 ± 0.0015 


49 ± 19 


0.0013 ± 0.0008 


70 ± 20 


0.0185 ± 0.0064 


70 ± 20 


0.0105 ± 0.0031 


70 ± 20 


0.0088 ± 0.0017 


69 ± 21 


0.0043 ± 0.0010 


100 ± 24 


0.0192 ± 0.0048 


100 ± 24 


0.0128 ± 0.0024 


99 ± 24 


0.0075 ± 0.0013 


99 ± 24 


0.0052 ± 0.0008 


139 ± 25 


0.0261 ± 0.0052 


140 ± 25 


0.0107 ± 0.0025 


140 ± 25 


0.0069 ± 0.0014 


139 ± 25 


0.0045 ± 0.0009 


179 ± 25 


0.0229 ± 0.0056 


179 ± 25 


0.0133 ± 0.0027 


180 ± 25 


0.0079 ± 0.0015 


180 ± 25 


0.0039 ± 0.0010 


219 ± 26 


0.0272 ± 0.0061 


219 ± 25 


0.0179 ± 0.0028 


220 ± 25 


0.0103 ± 0.0015 


220 ± 26 


0.0052 ± 0.0010 


259 ± 26 


0.0357 ± 0.0067 


259 ± 26 


0.0114 ± 0.0030 


259 ± 25 


0.0087 ± 0.0016 


260 ± 26 


0.0040 ± 0.0010 


308 ± 29 


0.0353 ± 0.0060 


309 ± 29 


0.0205 ± 0.0027 


309 ± 29 


0.0098 ± 0.0014 


309 ± 29 


0.0054 ± 0.0009 


369 ± 30 


0.0544 ± 0.0067 


369 ± 30 


0.0291 ± 0.0030 


369 ± 29 


0.0133 ± 0.0015 


369 ± 30 


0.0067 ± 0.0009 


445 ± 40 


0.0545 ± 0.0066 


446 ± 39 


0.0268 ± 0.0027 


446 ± 37 


0.0144 ± 0.0013 


448 ± 39 


0.0086 ± 0.0008 










546 ± 37 


0.0165 ± 0.0016 


546 ± 38 
646 ± 39 


0.0083 ± 0.0009 
0.0117 ± 0.0011 



Table 1. The angular power spectrum a] 



, , — [£(£ + 1)Ci/2it] measured for the four magnitude bins. These measurements are uncorrelated in 
the approximation of Gaussian fluctuations. Although the power spectrum is by definition non-negative, the allowed ranges above can include 
slightly negative values since our estimators are the difference of two powers (total observed power minus expected shot noise power). 



